Back

The Journal of Molecular Diagnostics

24 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
FA-NIVA: A Nextflow framework for automated analysis of Nanopore based long-read sequencing data for genetic analysis in Fanconi anemia
2026-03-04 genetic and genomic medicine 10.64898/2026.02.27.26346867
Top 0.5% (1.9%)
Show abstract

MotivationFanconi anemia (FA) is a rare disease mainly caused by biallelic pathogenic variants, including structural variants such as large deletions and insertions in FA genes. Currently, variant detection is based on short-read sequencing and probe-based approaches. However, determining the exact genomic breakpoint or achieving allelic discrimination remains challenging. Nanopore-based long-read sequencing enables a comprehensive detection of FA variants, but a unified bioinformatic analysis p...

2
Cancer genomic profiling predicts pathogenicity of BRCA1 and BRCA2 variants
2026-03-06 genetic and genomic medicine 10.64898/2026.03.05.26347746
Top 0.5% (1.9%)
Show abstract

Accurate classification of BRCA1 and BRCA2 variants is essential for cancer risk assessment and therapy selection, yet over one-third remain variants of uncertain significance (VUS). Here, using 120,660 real-world cancer genomic profiles with BRCA1 or BRCA2 variants from a >800,000-sample cohort, we develop machine learning models that predict pathogenicity using clinical and tumor-derived features, including a pan-cancer homologous recombination deficiency signature, co-mutated genes, zygosity,...

3
Integrative screening identifies functional variants and VNTRs underlying GWAS signals at the 5p15.33 multi-cancer susceptibility locus
2026-03-04 genetic and genomic medicine 10.64898/2026.03.03.26347427
Top 0.9% (1.5%)
Show abstract

Chromosome 5p15.33 harbors several independent association signals which demonstrate antagonistic pleiotropy across cancer types, with causal mechanisms largely unresolved. To identify functional variants and enhancer elements at this locus, we performed statistical fine-mapping followed by massively parallel reporter assays (MPRA) and proliferation based CRISPRi screens. This approach identified eight multi-cancer functional variants (MCFVs) across three GWAS signals. Targeting rs421629 (part o...

4
Molecular characterisation of a Klebsiella pneumoniae neonatal sepsis outbreak in a rural Gambian hospital: a retrospective genomic epidemiology investigation
2026-03-04 genetic and genomic medicine 10.64898/2026.03.03.26347025
Top 2% (1.1%)
Show abstract

BackgroundKlebsiella pneumoniae is a common cause of neonatal sepsis in Africa, and is frequently hospital acquired. We recently reported an outbreak of multidrug-resistant K. pneumoniae sepsis amongst neonates at a rural hospital in The Gambia, West Africa, involving 57 cases and case fatality of 60%. Here we undertook a retrospective pathogen genomic epidemiology study of clinical and environmental K. pneumoniae isolated during the outbreak, to identify the outbreak strain, refine the epidemic...

5
Pan-cancer tumour classification and risk stratification from whole-genome somatic variants via dual-task representation learning
2026-03-04 genetic and genomic medicine 10.64898/2026.03.02.26347318
Top 2% (0.7%)
Show abstract

Tumour typing from whole-genome sequencing is increasingly accurate, yet molecular subtyping from somatic variants remains challenging because of tumour heterogeneity and inconsistent clinical annotations. Here, we present Mutation-Attention Dual-Task (MuAt2), a Transformer model that jointly classifies histological tumour types and subtypes directly from somatic single-nucleotide variants, indels and structural variants. MuAt2 leverages encoders pre-trained on 2,587 pan-cancer whole genomes, an...

6
Deep Learning-based Differentiation of Drug-induced Liver Injury and Autoimmune Hepatitis: A Pathological and Computational Approach
2026-03-06 pathology 10.64898/2026.03.05.26347708
Top 4% (0.5%)
Show abstract

Drug-induced liver injury (DILI) is an acute inflammatory liver disease caused not only by prescription and over-the-counter medications but also by health foods and dietary supplements. Typically, DILI patients recover once the causative substance is identified and discontinued. In contrast, autoimmune hepatitis (AIH) results from the immune-mediated destruction of hepatocytes due to a breakdown of self-tolerance mechanisms. Patients presenting with acute-onset AIH often lack characteristic cli...

7
Too rare to be random: genetic finding suggests previously unrecognized path of mutagenesis
2026-03-04 genetic and genomic medicine 10.64898/2026.03.03.26346966
Top 4% (0.4%)
Show abstract

We report a previously undescribed genotypic configuration identified in twins with HNRNPU-related neurodevelopmental disorder. Both twins have two closely spaced mosaic variants on the same allele that never co-occur on any single DNA molecule, resulting in three distinct cell lineages within each individual. We define this genotypic configuration as clustered monoallelic mosaicism (cMoMa). Recognizing the extreme improbability of such a configuration, we systematically explore two potential me...

8
Prospective Multicenter Evaluation of the QuickNavi-Campylobacter Assay in Stool Specimens
2026-03-04 infectious diseases 10.64898/2026.03.03.26346362
Top 5% (0.4%)
Show abstract

The rapid diagnosis of Campylobacter infections is important for the management of infectious gastroenteritis. Although stool culture is considered the gold standard, its sensitivity is limited and it requires prolonged incubation times. We performed a prospective multicenter study at nine healthcare facilities in Japan to evaluate a Campylobacter rapid antigen test using stool specimens between March 2024 and August 2025. Patients with suspected infectious gastroenteritis were consecutively enr...

9
Gene Portals: A Framework for Integrating Clinical, Functional, and Structural Evidence into Rare Disease Variant Classification
2026-03-06 genetic and genomic medicine 10.64898/2026.03.05.26347086
Top 6% (0.3%)
Show abstract

Rare Mendelian disorders affect 300-400 million people globally. Although genetic testing has become widely adopted, gene-specific evidence for tailored variant interpretation remains scattered across resources. We present Gene Portals, a framework for gene-centered multimodal knowledge bases that co-localize expert-harmonized clinical data, functional assays, population variation, structural annotations and gene-specific ACMG/AMP specifications within a single resource. A modular interface inte...

10
BEGA-UNet: Boundary-Explicit Guided Attention U-Net with Multi-Scale Feature Aggregation for Colonoscopic Polyp Segmentation
2026-03-05 gastroenterology 10.64898/2026.03.04.26347608
Top 7% (0.3%)
Show abstract

Accurate polyp segmentation from colonoscopy images is critical for colorectal cancer prevention, yet the generalization of deep learning models under domain shift remains insufficiently explored. We propose Boundary-Explicit Guided Attention U-Net (BEGA-UNet), a boundary-aware segmentation architecture that introduces explicit edge modeling as a structural inductive bias to enhance both segmentation accuracy and cross-domain robustness. The framework integrates three components: an Edge-Guided ...

11
Performance of an Optimized Methylation-Protein Multi-Cancer Early Detection (MCED) Test Classifier
2026-03-04 oncology 10.64898/2026.03.03.26347329
Top 7% (0.3%)
Show abstract

Multi-cancer early detection (MCED) tests can detect several cancer types and stages. We previously developed a methylation and protein (MP V1) MCED classifier. In this study, we present a refined MP V2 classifier, developed by evaluating model architectures that improved performance in prospectively enrolled case-control cohorts under standard testing conditions. The newly developed MP V2 classifier was trained to be more generalizable and achieve increased early-stage sensitivity at a target s...

12
Application of a Concise Video to Improve Patient Understanding of Tumor Genomic Testing in Community and Academic Practice Settings
2026-03-06 oncology 10.64898/2026.03.05.26347758
Top 7% (0.3%)
Show abstract

Purpose: Tumor genomic testing (TGT) is standard-of-care for most patients with advanced/metastatic cancer. Despite established guidelines, patient education prior to TGT is frequently omitted. The purpose of this study was to evaluate the impact and durability of a concise 3-4 minute video for patient education prior to TGT in community versus academic sites and across cancer types. Patients and Methods: Patients undergoing standard-of-care TGT were enrolled at a tertiary academic institution ...

13
Prediction of incident coronary artery disease in individuals with zero coronary artery calcium using a novel multi-ancestry, label-free polygenic risk score framework
2026-03-04 genetic and genomic medicine 10.64898/2026.03.02.26347474
Top 7% (0.3%)
Show abstract

BackgroundA coronary artery calcium (CAC) score of 0 is widely considered to indicate low short- to intermediate-term risk for coronary artery disease (CAD) and is frequently used to defer lipid-lowering therapy. However, a subset of individuals with CAC=0 still experience events, highlighting residual risk not captured by imaging alone. Polygenic risk scores (PRS) quantify lifelong inherited susceptibility, but conventional approaches rely on predefined ancestry labels despite human genetic div...

14
Association of the FTO rs9939609 variant with glycemic control
2026-03-05 genetic and genomic medicine 10.64898/2026.03.05.26347689
Top 8% (0.3%)
Show abstract

Type 2 diabetes (T2D) affects 11.1% of the global population, underscoring the need for biomarkers that inform treatment response and glycemic outcomes. We evaluated the association between the FTO variant rs9939609-A and glycemic control in a Mexican population. A total of 174 individuals living with T2D from Merida and Sisal, Yucatan, were included, of whom 85% were receiving oral hypoglycemic agents as main treatment. Glycemic control was defined cross-sectionally as good ([≤]130 mg/dL, n=...

15
HIPK4 is a novel gene associated with teratozoospermia and male infertility
2026-03-04 sexual and reproductive health 10.64898/2026.03.04.26346694
Top 8% (0.3%)
Show abstract

STUDY QUESTIONAre pathogenic variants in Homeodomain-interacting protein kinase (HIPK4) associated with sperm head abnormalities causing male infertility? SUMMARY ANSWERHIPK4 is a novel candidate gene associated with sperm head defects and human male infertility. WHAT IS KNOWN ALREADYNumerous genes causing male infertility due to Multiple Morphological Abnormalities of the sperm flagella (MMAF) have been described but the genetic basis of sperm head defects is less well understood. STUDY DESI...

16
NIR autofluorescence allows for pituitary gland detection during surgery: the first evidence from microscopic studies and in vivo measurements
Top 8% (0.3%)
Show abstract

A critical challenge in endocrine neurosurgery is intraoperative discrimination between normal pituitary tissue and pituitary neuroendocrine tumors (PitNETs). Suggesting the universal persistence of near-infrared autofluorescence (NIRAF) in endocrine organs and inspired by routine clinical use of NIRAF for parathyroid gland identification, we discovered that pituitary NIRAF can be employed for label-free transsphenoidal surgery guidance. Ex vivo confocal spectral imaging of 33 specimens identifi...

17
Cultryx: Precision Diagnostic Stewardship for Blood Cultures Using Machine Learning
2026-03-04 infectious diseases 10.64898/2026.02.27.26347214
Top 9% (0.3%)
Show abstract

BackgroundThe 2024 blood culture bottle shortage brought diagnostic resource allocation to the forefront, reflecting persistent, foundational challenges with low-value testing and empiric treatment approaches under clinical uncertainty. ObjectiveTo determine whether a machine learning approach using electronic medical record data can predict bacteremia more effectively than existing systems and practices to guide diagnostic testing and empiric treatment strategies. MethodsIn a retrospective co...

18
Stability of Microbiome-Derived Fatty Acids in Self-Collected Samples: A Comparative Evaluation of Stool and Blood Matrices
2026-03-06 gastroenterology 10.64898/2026.03.05.26347712
Top 9% (0.3%)
Show abstract

Background Short-chain fatty acids (SCFAs) are widely used as functional readouts of gut microbial activity in vivo. The growing adoption of decentralised study designs and self-collection protocols has amplified the need for reliable room-temperature storage and shipment strategies. However, SCFAs volatility and the persistence of post-collection microbial metabolism raise concerns regarding pre-analytical stability and the interpretability of measured concentrations. Methods We assessed the te...

19
OncoRAG: Graph-Based Retrieval Enabling Clinical Phenotyping from Oncology Notes Using Local Mid-Size Language Models
2026-03-06 oncology 10.64898/2026.03.05.26347717
Top 9% (0.3%)
Show abstract

Introduction: Manual data extraction from unstructured clinical notes is labor-intensive and impractical for large-scale clinical and research operations. Existing automated approaches typically require large language models, dedicated computational infrastructure, and/or task-specific fine-tuning that depends on curated data. The objective of this study is to enable accurate extraction with smaller locally deployed models using a disease-site specific pipeline and prompt configuration that are ...

20
Analysis Of Clinicopathological Histomorphological And Molecular Differences In Right And Left Sided Colonic Carcinoma
2026-03-04 health systems and quality improvement 10.64898/2026.03.03.26347325
Top 9% (0.3%)
Show abstract

BackgroundColorectal carcinoma (CRC) remains a significant cause of cancer morbidity and mortality worldwide. Right- and left-sided tumours differ in clinical, morphological, and molecular features. Microsatellite instability-high (MSI-H) tumours, often right-sided, are associated with distinct histopathological characteristics and prognostic implications. In Sri Lanka, molecular MSI testing is currently unavailable, highlighting the need for alternative predictive approaches. ObjectivesGeneral...